Regularization Methods for Canonical Correlation

نویسنده

  • Ying Xu
چکیده

Regularization Methods for Canonical Correlation Analysis, Rank Correlation Matrices and Renyi Correlation Matrices by Ying Xu Doctor of Philosophy in Statistics University of California, Berkeley Professor Peter J. Bickel, Chair In multivariate analysis, canonical correlation analysis is a method that enable us to gain insight into the relationships between the two sets of variables. It determines linear combinations of variables of each type with maximal correlation between the two linear combinations. However, in high dimensional data analysis, insufficient sample size may lead to computational problems, inconsistent estimates of parameters. In Chapter 1, three new methods of regularization are presented to improve the traditional CCA estimator in high dimensional settings. Theoretical results have been derived and the methods are evaluated using simulated data. While the linear methods are successful in many circumstances, it certainly has some limitations, especially in cases where strong nonlinear dependencies exist. In Chapter 2, I investigate some other measures of dependence, including the rank correlation and its extensions, which can capture some non-linear relationship between variables. Finally the Renyi correlation is considered in Chapter 3. I also complement my analysis with simulations that demonstrate the theoretical results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Regularization of Canonical Correlation Analysis

By elucidating a parallel between canonical correlation analysis (CCA) and least squares regression (LSR), we show how regularization of CCA can be performed and interpreted in the same spirit as the regularization applied in ridge regression (RR). Furthermore, the results presented may have an impact on the practical use of regularized CCA (RCCA). More specifically, a relevant cross validation...

متن کامل

Appendix: Multimodal Omics Data Integration Using Max Relevance-Max Significance Criterion

This paper presents a novel supervised regularized canonical correlation analysis, termed as CuRSaR, to extract relevant and significant features from multimodal high dimensional omics data sets [1]. The proposed method extracts a new set of features from two multidimensional data sets by maximizing the relevance of extracted features with respect to sample categories and significance among the...

متن کامل

Asymmetrically Weighted CCA And Hierarchical Kernel Sentence Embedding For Multimodal Retrieval

Joint modeling of language and vision has been drawing increasing interest. A multimodal data representation allowing for bidirectional retrieval of images by sentences and vice versa is a key aspect of this modeling. In this paper we show that a cross-view mapping of the search space to the query space achieves state of the art performance in bidirectional retrieval using off the shelf feature...

متن کامل

A pseudoproxy evaluation of the CCA and RegEM methods for reconstructing climate fields

14 Abstract 15 Canonical correlation analysis (CCA) is evaluated for paleoclimate field reconstructions in the context 16 of pseudoproxy experiments assembled from the millennial integration (850-1999 C.E.) of the National 17 Center for Atmopsheric Research Climate System Model 1.4. A parsimonious method for selecting 18 the order of the CCA model is presented. Results suggest that the method i...

متن کامل

Rough Hypercuboid Based Supervised Regularized Canonical Correlation for Multimodal Data Analysis

One of the main problems in real life omics data analysis is how to extract relevant and non-redundant features from high dimensional multimodal data sets. In general, supervised regularized canonical correlation analysis (SRCCA) plays an important role in extracting new features from multimodal omics data sets. However, the existing SRCCA optimizes regularization parameters based on the qualit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011